Rank in Wordlist | Frequency | Word |
---|---|---|
2117 | 71 | 1,5 |
3834 | 38 | 1,2 |
4025 | 36 | 2,5 |
4489 | 32 | 0,5 |
4719 | 30 | 1,3 |
4844 | 29 | 1,8 |
5016 | 28 | 3,5 |
5503 | 25 | 1,7 |
5713 | 24 | 1,4 |
5912 | 23 | 2,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2458 | 61 | 10% |
2853 | 52 | 20% |
3014 | 49 | 100% |
3561 | 41 | 50% |
3931 | 37 | 30% |
4255 | 34 | 15% |
4492 | 32 | 5% |
4721 | 30 | 3% |
4843 | 29 | 1% |
5504 | 25 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
2822 | 53 | S&P |
4927 | 29 | Standard & Poor's |
7149 | 19 | S&P 500 |
12254 | 10 | Lindt & Sprüngli |
13266 | 9 | Johnson & Johnson |
14154 | 8 | & Co |
14431 | 8 | Ernst & Young |
14542 | 8 | H&M |
15758 | 7 | AT&T |
16614 | 7 | Roth & Rau |
Rank in Wordlist | Frequency | Word |
---|---|---|
103438 | 1 | M$-Fanboy |
130016 | 1 | US$/CHFr |
130017 | 1 | US$Dollar |
158466 | 1 | µ$oft-Windows-Phone |
Rank in Wordlist | Frequency | Word |
---|---|---|
695 | 214 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
3747 | 39 | 10'000 |
3866 | 38 | Moody's |
4916 | 29 | Poor's |
4927 | 29 | Standard & Poor's |
5013 | 28 | 100'000 |
5913 | 23 | 20'000 |
6799 | 20 | McDonald's |
7002 | 19 | 30'000 |
7234 | 19 | geht's |
7664 | 17 | 200'000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
5200 | 27 | Google + |
7796 | 17 | Kühne + Nagel |
16217 | 7 | Huber+Suhner |
16342 | 7 | Kühne+Nagel |
32650 | 3 | K+N |
34593 | 3 | Schmolz + Bickenbach |
39209 | 2 | 1+1 |
47013 | 2 | Kühne + Nagel International |
50138 | 2 | Rio+20 |
63778 | 1 | 5+1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
6889 | 20 | awp/sda |
8408 | 16 | km/h |
9499 | 14 | und/oder |
9546 | 13 | 2011/12 |
9556 | 13 | 9/11 |
9562 | 13 | AWP/sda |
11293 | 11 | KEYSTONE/AP |
12041 | 10 | DEVISEN/Euro |
12933 | 9 | 2010/11 |
14396 | 8 | EUROPA/Ausblick |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots